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ABSTRACT 

A scheme for estimating atmospheric parameters Tgff, log g, and [Fe/H] is proposed 
on the basis of Least Absolute Shrinkage and Selection Operator (LASSO) algorithm 
and Haar wavelet. The proposed scheme consists of three processes. A spectrum is 
decomposed using the Haar wavelet transform and low-frequency components at the 
fourth level are considered as candidate features. Then, spectral features from the can¬ 
didate features are detected using the LASSO algorithm to estimate the atmospheric 
parameters. Finally, atmospheric parameters are estimated from the extracted spectral 
features using the support-vector regression (SVR) method. The proposed scheme was 
evaluated using three sets of stellar spectra respectively from Sloan Digital Sky Sur¬ 
vey (SDSS), Large Sky Area Multi-object Fiber Spectroscopic Telescope (LAMOST), 
and Kurucz’s model, respectively. The mean absolute errors are as follows: for 40 000 
SDSS spectra, 0.0062 dex for log Teff (85.83 K for Tgff), 0.2035 dex for log g and 
0.1512 dex for [Fe/H]; for 23963 LAMOST spectra, 0.0074 dex for log Tgff (95.37 K 
for Tgff), 0.1528 dex for log g, and 0.1146 dex for [Fe/H]; and for 10469 synthetic 
spectra, 0.0010 dex for log Tgff (14.42K for Tgff), 0.0123 dex for log g, and 0.0125 dex 
for [Fe/H]. 

Key words: methods: statistical-techniques: spectroscopic-stars: atmospheres- 
stars:fundamental parameters 


1 INTRODUCTION 


The implementation 
vey pr ograms, such 


of 


(SDSS: lYork et al.ll200(]|: 


and deep sky sur- 
Dig ital Sky Survey 
Ahi^et.^ 201J)^ Gaia-ESO Sur- 


large-scale 
the Sloan 


vey (GES: iGilmore et al l2012l : iRandich et all l2013l ) and 


the Large Sky Area Multi-object Fiber S pectroscopic Tele¬ 
scope (LAMOS T/Guoshoujing Telescope: IZhao et al.ll2od^ : 
iGui et aP l2012l ) , has resulted in the collection of a large 
amount of stellar spectra. The bulk of this stellar spectral in¬ 
formation necessitates utilisation of a fully automated char¬ 
acterisation process. 

In particular, estimating the effective temperature, sur¬ 
face gravity, and met allicity from stellar spectra rema ins a 
fundame ntal problem (Wu et al.lf201ll : [Song et al.ll2012l ). For 
example. iMuir head et all ( 20121 ) investigated the estimation 
of the effective temperature Teff and metallicity [M/H] for 
late-K and M-type planet-candidate host stars from the K- 
band spectra released by the Kepler Mission. In this study, 
a surface fitting method was used along with three spec¬ 
tral indices, namely the equivalent widths of Nal (2.210 
^m) and Gal (2.260 pm) lines and an index describing the 
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flux change between three 0.02pm wide bands centred at 
2.245, 2.370, and 2.080 pm. iKoleva et al.l ll2009l ) developed 
a full-spectrum fitting package, ULySS (University of Lyon 
Spectroscopic analysis Software), and explored i t s app lica- 
tion in spectral parameterization. iManteiga et al.l (l2010l ) pa- 
rameterised stellar spectra by extracting features based on 
Fourier analysis and wavelet decomposition as well as by 
constructing a mapping from feature space to parameter 
space using a forward neural networks (FNN) with three 
layers. 


An automated estimation system for atmospheric par¬ 
ameters from a stellar spectrum is also referred to as a 
spectrum-parameterisation system (SPS) in related stud¬ 
ies. An SPS system consists of two key processes, namely 
feature extraction and parameter estimation. Feature ex¬ 
traction determines representation of sp ectral information. 
Principal component a nalysis (PGA: iRe Fiorentin et ^ 
2007j;_ Bu_aj]dYanl 20151 ) and fast Fourier transform (FFT: 


Manteiga et al.l 20101 ) are the two most frequently used fea¬ 


ture extraction methods. Parameter estimation constructs 
a mapping from the extracted features of a spectrum to 
its atmospheric parame ters. Some commonly us ed estima¬ 
tion methods are FNN (iRe Fiorentin et al.ll2007l ). Gaussian 
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proces s (GP: Bu and ParJl2015h . support-vector regression 
(SVRi lLi et all 2014 ). etc. 

This work describes a scheme for estimating atmo¬ 
spheric parameters from the stellar spectrum. The three pro¬ 
cesses of the proposed scheme are performed as follows. Sev¬ 
eral candidate features are initially obtained by transform¬ 
ing a spectrum into a low-frequency space using the Haar 
wavelet transform. Then, some representative candidate fea¬ 
tures are selected as spectral features using the Least Abso- 
lute Shrinkage an d Selection Operator (LASSO) algorithm 
dTibshiranilligod l. Finally, atmospheric parameters are esti¬ 
mated from extracted features using a regression method. 

High-frequency components are removed in the first 
process. These components are usually more affected by 
noise than low-frequency components, which will be dis¬ 
cussed further in section [El The second process re¬ 
duces the number of spectral features efficiently, a fea¬ 
ture closely related to the estimation efficiency and explo¬ 
ration of representative features fSection 15.311 . In the third 
process, the spectrum parameterization problem is investi¬ 
gated using four typical estimation methods: the radial basi s 
function neural network (RBFNN : |SchjTOnkerefL_alJ |20(llf^ 
SVR. (IChang fc Liniliooll : ISchokopf et al.ll200^ Smola^t^ 


2004), A-nearest neighbor regression (K NNR:|AJtman_eLaL, 
1992 ) and least square regression (LSR: [James et al.ll2013h . 
Experimental results indicate that SVR is superior to the 
other three methods. 

This article is organised as follows. Section [2] describes 
some applied experimental data. After introducing four re¬ 
gression methods in Section [Hj Section [T] describes a scheme 
for extracting spectral features. In Section [E the proposed 
scheme is evaluated experimentally. Finally, Section [1 con¬ 
cludes this work 


radial velocity) using the radial velocity provided by 
the SDSS/the Sloan Extension for Galactic Understand¬ 
ing and Exploratio n (SEGUE) Spe c troscopic Param e- 


ter Pipeline (SSPP: Beers et al. 

20061: Lee et al.l 2008a[[bl. 

[ 2011 ]: lAllende Prieto et al.l 20081: 

Smolinski et al.|[2011^ and 


rebinned to a maximal common log(wavelength) range 
[3.581862, 3.963961] with a sampling step of 0.0001. 
The common wavelength range is approximately [3818.23, 
9203.67]A. The SDSS training and test sets are labeled as 
StL' and SfL’, respectively. Without special instructions, 
the sizes of and StL' are 10 000 and 40 000, respec¬ 
tively. 


2.2 Actual spectra from LAMOST 

A set of 33 963 LAMOST stellar spectra |[Luo et al.|[2012fl 
and their physical parameters from LAMOST pipeline are 
selected based on two signal-to-noise ratio (SNR) con¬ 
straints: name ly SNRg ^ 20 and SNR,- ^ 20 in gr—band 
and r—band (ILuo et al.l l2012fl Q. This constraint is pro¬ 
posed by considering the current quality stability of the 
LAMOST spectra. The LAMOST spectra are also shifted 
to their rest frames using the radial velocity provided by 
LAMOST SSPP and rebinned to their maximal common 
log(wavelength) range [3.5845, 3.9567] with a sampling step 
of 0.0001. The common wavelength range is approximately 
[3841.49, 9051.07]A. All LAMOST spectra span the ranges 
[3853.2, 9927] K for Teff, [0.8920, 4.9959]dex for log g and 
[-2.3280, 0.9360] dex for [Fe/H]. The LAMOST training and 
test sets are labeled as and SiY, respectively. StY con¬ 
sists of 10,000 stellar spectra, while consists of 23,963 
spectra. 


2 DATA 

The proposed scheme was evaluated by performing exper¬ 
iments using three data sets: 33 963 actual spectra from 
LAMOST, 50 000 actual spectra from SDSS and 18 969 
synthetic spectra computed from Kuruczs model. Actual 
spectra usually carry some disturbances from noise and pre¬ 
processing imperfections. 

The proposed scheme is a statistical learning method. It 
basically involves the discovery of mapping from stellar spec¬ 
tra to their atmospheric parameters using empirical data, 
which is referred to as a training set in machine learning. At 
the same time, the performance of the mapping discovered 
should be evaluated objectively. Therefore, an independent 
set of stellar spectra is needed for this evaluation. This in¬ 
dependent set is usually referred to as a test set. Thus, in 
each experiment, stellar spectra are split into two subsets, 
namely the training and test sets. 


2.1 Actual spectra from SDSS 

A set of 50 000 stellar spectra and the ir physical par¬ 
amet e rs collected f r om S DSS are selected (lAbazaiian et al] 
I2OO9I : [Yannv et al.l [200^. The selected spectra span the 
ranges of [4088, 9740] K for Teff, [1.015, 4.998] dex 
for surface gravity and [-3.497, 0.268] dex for [Fe/H]. 
All stellar spectra are shifted to their rest frames (zero 


2.3 Synthetic spectra 


A set of 18 969 synthetic spectra are calculated from 
Kurucz’s NEWODF (new opaci ty distribution function) 


models ([Gastelli &: Kurucz 


(v2.76) package I Gray et al.l 1994 1 as well as 830 828 atomic 


2003) using the SPEGTRUM 


and molecular lines (contained in two files, luke.lst and 
luke.nir.lst). These atomic and molecular data are stored in 
the fi le stdatom.dat, which includes solar atomic abundances 
from iGrevesse et al.l lll998li . The SPECTRUM package and 
the three data files can be downloaded from website Q 

The grids of the synthetic stellar spectra span parame¬ 
ter ranges [4000,9750] K for Teff (45 values, stepsizes of 100 
K between 4000 K and 7500 K and 250 K between 7750 K 
and 9750 K), [1, 5] dex for log g (17 values, stepsize of 0.25 
dex) and [-3.6, 0.3] dex for [Fe/H] (27 values, stepsizes of 
0.2 dex between -3.6 dex and -1 dex and 0.1 dex between 
-1 dex and 0.3 dex). The synthetic stellar spectra are also 
split into two subsets: a training set SfY and a test set StY 
consisting of 10 469 and 8 500 spectra, respectively. 


^ This constraint was not used on SDSS data. 

^ http://www.appstate.edu/ grayro/spectrum/spectrum.html 
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3 ESTIMATION MODELS AND EVALUATION 
METHODS 

Here, x represents a description of a stellar spectrum, while 
y is the atmospheric parameter Teff, log g, or [Fe/H] of x. 

3.1 Estimation models 

The spectrum parameterization problem is to recover the 
mapping /: 

y = fix) (1) 

from a set of empirical data (training set). 


In this work, / is esti mated using four typi cal re¬ 


Eression methods: RBFNN (ISchwenker et al.l 

2001 IL SVR 

llChans & LinI 

2001: Schokoof et al. I 2 OO 2 I: 

Smola et al. 

2004 

1, KNNR jAltman et al.lll992fl and LSR 1 

James et al. 

201 a 

ods. 

). RBFNN and KNNR are non-linear regression meth- 
while LSR is a linear regression method. SVR can be 


implemented with a Gaussian kernel and a linear kernel. 
These two implementation cases are labelled as SVRg and 
SVRj, respectively. Therefore, SVRg is a nonlinear method 
whereas SVR; is a linear method. 

3.2 Evaluation criteria 

Suppose that / is an estimate of the spectrum parameteriza¬ 
tion mapping / and S' = {(a:, j/)} is a data set, where a: is a 
representation of a stellar spectrum and y is an atmospheric 
parameter of the corresponding star. The data set S can be 
a training or test set. 

For convenient comparison with related reports, this 
study evaluates the performance of an estimate / using three 
methods: mean error (ME), mean of absolute error (MAE) 
and standard deviation (SD). These thr ee evaluation meth¬ 
ods are widely used in related studies dRe Fiorentin et al.l 
120071: 1 Jofre et al.ll2010l:lTan et al.ll2013h . 

Using the data set S, the three evaluation criteria can 
be defined as follows: 

= i iy-fi^))’ (2) 

(a2,y)GS 

\y-f{x)\, (3) 

(a:,y)es 


= E iy-fix))L (4) 

V 

where n is the amount of elements in S. 

4 FEATURE EXTRACTION 

4.1 Extract Candidate Features 

Atmospheric parameters Teff, log g, and [Fe/H] show an ev¬ 
ident non-linear de pendence on st ellar spectra (Table 6, Ta¬ 
ble 10, Table 11 in iLi et al.ll2014l l. Therefore, the spectrum 
parameterization problem is investigated by non-linearly 
transforming a spectrum before estimating atmospheric par¬ 
ameters. In this work, stellar spectra are transformed using 


a Haar wavelet llMaIlatll2009l l and decomposed into a series 
of components with different wavelengths and frequencies 
(time-frequency localization). 

High-frequency components are usually more affected 
by noise than low-frequency components. Thus, this work 
obtains candidate features by removing high-frequency com¬ 
ponents. This process will be discussed further in Section 
15.31 

In addition to the Haar wav elet, multiple alternatives 
for a non-linear transformation (iMallatl l2009l : iDaubechiesI 
I 1992 II . such as the Coifiets, Daubechies, Symmlet and 
biorthogonal wavelets, are available based on wavelet trans¬ 
form. 


4.2 Refining the candidate features 

Experiments indicate the presence of many redundancies in 
the extracted candidate features (Section 15.31) . Therefore, 
this work proposes a scheme for detecting spectral features 
from the extracted candid ate features using the LASSO al¬ 
gorithm (lTibshiranilll996ll . 

Let Str = {(a:, j/)} be a training set (Section [2]), where 
X = {xi,--- ,Xm) represents a stellar spectrum based on 
its candidate features, y is an atmospheric parameter of the 
corresponding star and m is a positive integer. The LASSO 
algorithm selects features using the following model: 

m m 

w^argimn{ ^ ( 1 /- ^ -I-A ^ jjrci jj}, (5) 

(a2,y)^S i=l i=l 

where A > 0 is a preset parameter. In this model, only a few 
Wi will be non-zero, and the variables Xi with a non-zero Wi 
are selected as spectral features. The parameter A controls 
the amount of non-zero parameters Wi or, equivalently, the 
number of detected features. In this wor k, the parameter 
A is estimated by ten fold cross validation (iTibshiranill 19961 : 
ISiostrand et ahlfeoPSl L 

Based on the training set from SDSS (Section 12.11) . 17 
spectral features are detected for Teff, 24 spectral features 
for log g, and 25 features for [Fe/H] (Table [T]). 


5 EXPERIMENTS AND DISCUSSION 

5.1 Performance for SDSS spectra 

From the detected features in Table [T1 a spectrum parame¬ 
terization model can be learned from the training set StS 
(Section 12.11) . The performance obtained using the test set 
StS is presented in Tabled 

For the SDSS test set, MAE errors are 0.0062, 0.2035 
and 0.1512 dex for log Teff, log g and [Fe/H], respectively. To 
compare the proposed scheme with those in previous related 
reports, the performance of the proposed scheme was also 
evaluated using measures ME and SD. More experimental 
results are presented in Table [2] as well as Figs. [T] and 
Direct comparisons with reports are shown in Section[^and 
more discussion on the dispersion in Fig. [T] is presented in 
Section EH 
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Table 1. Extracted features for estimating atmospheric parameters. WP is the wavelength position represented by a two-dimensional 
vector [a, b], where a and b are the starting and ending wavelengths (A). 


(a) Extracted features for estimating Teff. 

Label 

WP(A) 

Label 

WP(A) 

Label 

WP(A) 

Label 

WP{A) 

Ti 

[3932.439, 3946.045] 

T2 

[4217.567, 4232.159] 

Ts 

[4780.375, 4796.915] 

T 4 

[4851.343, 4868.128] 

Ts 

[5033.406, 5050.821] 

Te 

[5070.631, 5088.175] 

T 7 

[5108.131, 5125.804] 

Ts 

[5126.984, 5144.723] 

Tg 

[5145.908, 5163.712] 

Tio 

[5164.901, 5182.771] 

Til 

[5203.098, 5221.100] 

Ti2 

[6562.389, 6585.094] 

Ti3 

[8524.364, 8553.857] 

Ti4 

[8650.914, 8680.845] 

Tis 

[8747.058, 8777.321] 

Tie 

[8844.270, 8874.870] 

Ti7 

[9008.697, 9039.866] 







(b) Extracted features for estimating log g. 

Label 

WP(A) 

Label 

WP(A) 

Label 

WP(A) 

Label 

WP(A) 

Li 

[3818.229, 3831.440] 

L 2 

[3889.215, 3902.672] 

L 3 

[3932.439, 3946.045] 

L 4 

[4095.076, 4109.245] 

Ls 

[4295.978, 4310.841] 

Le 

[4540.064, 4555.772] 

L? 

[4556.821, 4572.587] 

Lg 

[4573.640, 4589.464] 

Lg 

[4658.671, 4674.789] 

Lio 

[4833.503, 4850.226] 

Lii 

[4851.343, 4868.128] 

Li2 

[4869.249, 4886.096] 

Li3 

[4887.221, 4904.130] 

Li4 

[4923.365, 4940.399] 

Li5 

[5164.901, 5182.771] 

Lie 

[5183.964, 5201.900] 

Li7 

[5222.302, 5240.371] 

Li8 

[5241.577, 5259.712] 

Lig 

[5280.341, 5298.611] 

L 20 

[5299.831, 5318.167] 

Lgi 

[5319.392, 5337.796] 

L 22 

[5418.287, 5437.033] 

L 23 

[5498.725, 5517.750] 

L 24 

[6562.389, 6585.094] 

(c) Extracted features for estimating [Fe/H]. 

Label 

WP(A) 

Label 

WP(A) 

Label 

WP(A) 

Label 

WP{A) 

Fi 

[3932.439, 3946.045] 

F2 

[3990.819, 4004.626] 

Fs 

[4005.549, 4019.407] 

F 4 

[4020.333, 4034.242] 

Fs 

[4035.172, 4049.133] 

Fe 

[4506.735, 4522.327] 

Ft 

[4607.465, 4623.406] 

Fs 

[4745.282, 4761.700] 

Fg 

[4780.375, 4796.915] 

Fio 

[4798.019, 4814.620] 

Fii 

[4815.729, 4832.390] 

Fi2 

[4851.343, 4868.128] 

Fi3 

[4869.249, 4886.096] 

Fi4 

[4941.536, 4958.633] 

Fis 

[4959.775, 4976.935] 

Fie 

[5051.984, 5069.463] 

Fi7 

[5108.131, 5125.804] 

Fi8 

[5241.577, 5259.712] 

Fig 

[5260.924, 5279.126] 

F 20 

[5280.341, 5298.611] 

Fgi 

[5299.831, 5318.167] 

F 22 

[5398.362, 5417.039] 

F 2 S 

[5438.285, 5457.101] 

F 24 

[8524.364, 8553.857] 

F 25 

[8650.914, 8680.845] 








Table 2. Performance of the proposed scheme on 40 000 test spectra from SDSS (10 000 SDSS spectra for training, Section l2.1|l 


Method 

Parameter 

MAE 

log Teff(Teff) 
ME 

SD 

MAE 

log 9 
ME 

SD 

MAE 

[Fe/H] 

ME 

SD 

RBFNN 

0.0065(88.48) 

4.42xl0-'‘(6.28) 

0.0107(148.04) 

0.2159 

0.0205 

0.3228 

0.1547 

6.04x10-4 

0.2197 

SVRg 

0.0062(85.83) 

6.05xl0-'‘(9.40) 

0.0101(146.66) 

0.2035 

-0.0193 

0.3053 

0.1512 

1.19x10-2 

0.2158 

KNNR 

0.0069(94.77) 

-8.39xl0-"‘(-10.13) 

0.0109(154.62) 

0.2178 

-0.0370 

0.3069 

0.2198 

-3.56x10-2 

0.2999 

LSR 

0.0072(99.22) 

3.45x10-4(5.46) 

0.0111(160.73) 

0.2594 

0.0270 

0.3574 

0.1786 

3.61x10-3 

0.2472 

SVRi 

0.0070(96.77) 

3.41x10-4(7.11) 

0.0111(162.79) 

0.2417 

0.0475 

0.3648 

0.1758 

-8.62x10-3 

0.2466 


Notes. The unit for Tea is K; The unit for logTeff is log(K). 





Figure 1. Performance of the proposed scheme on 40 000 test spectra from SDSS (10 000 SDSS spectra for training, Section l2.1ll using 
SVRg 
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(a) log Teff (b) log g (c) [Fe/H] 

Figure 2. Residual distributions of the proposed scheme on 40 000 test spectra from SDSS (10 000 SDSS spectra for training, Section 

[O 


5.2 Performance for LAMOST spectra and 
synthetic spectra 

The proposed scheme is also tested on actual spectra from 
LAMOST (section |2.211 and synthetic spectra (Section 12.311 . 

The performances of the scheme on LAMOST spectra 
is presented in Table |3l as well as Figs. |3] and [4] The test 
results using synthetic spectra are presented in Table [4] as 
well as Figs. [S] and 1^ The results in Tables [21E] and [T] show 
that the SVRg and RBFNN are more suitable than KNNR, 
LSR and SVR; for estimating atmospheric parameters. 

5.3 Filtering and Selection - Positive or Negative? 

The proposed scheme extracts spectral features by remov¬ 
ing high-frequency components from the Haar wavelet trans¬ 
form and rejecting most low-frequency components by the 
LASSO algorithm. In this study, we determine whether these 
processes eliminate important spectral information, such as 
weak lines. 

Four experiments are conducted and the results are 
listed in TableO The results show the possibility of eliminat¬ 
ing useful spectral information. However, in application, the 
observed spectrum is inevitably contaminated with noise. In 
theory, weak lines should be more sensitive to noise. 

Therefore, the loss of the elimination is trivial. The 
wavelet components with the lowest frequency are tradi¬ 
tional choices for s pectral feature s for estimating atmo¬ 
spheric parameters (ILu et al.ll2013ll . In the experiments for 
Teff, when all low-frequency wavelet components are used, 
the number of features will increase from 17 to 239 (increase 
(239-17)/17 = 1305.88 percent), but the MAE can only de¬ 
crease by 0.0007 dex (11.29 percent: Experiment 1 and 2). 
When no component is eliminated while estimating Teff, 
the number of features will increase from 17 to 3823 (an 
increase of 22388.23 percent), but the MAE error increases 
by 0.0398 dex (641.93 percent). A small number of detected 
features indicates an efficient process for estimating atmo¬ 
spheric parameters from spectral features. The above results 
suggest that the model developed in this work can estimate 
stellar atmospheric parameters with high accuracy. 

5.4 Knowledge base, dispersion and performance 

The proposed scheme is a statistical learning method. Its 
primary principle is to discover automatically the mapping 


from a stellar spectrum to its atmospheric parameters from 
a training set. The training set is the carrier of knowledge 
and affects the accuracy of the scheme. 

Therefore, the size of the training set affects the perfor¬ 
mance of the proposed scheme. For example, if the size of 
the training set is increased from 10 000 to 15 000, 20 000, 
25 000 in the experiments on SDSS spectr43, the test disper¬ 
sion can be clearly improved (Fig. (7)). Similar experiments 
are conducted on synthetic spectra and the corresponding 
results are presented in Fig. jS] 

Actuall data usually present some disturbances arising 
from noise and pre-processing imperfections (e.g., sky lines 
and/or cosmic ray removal residuals, residual calibration de¬ 
fects and interstellar extinction instability 0). The negative 
effect from these factors can be reduced to a certain extent 
by enriching the knowledge carrier, i.e. the training set (Figs. 
[7] and jS]) . 


5.5 Compactness 

For simplicity, this section considers the features of log g as 
an example to discuss compactness. Other features of Teff 
and [Fe/H] can be discussed similarly. 

The original SDSS spectra are described by 3,821 fluxes. 
To estimate log g, 24 features are detected and data reduc¬ 
tion is (3821 - 24)/(3821 )r:; 99.37 percent. This result indi¬ 
cates that log g can be estimated from a spectral description 
with a dimension of 24 instead of 3821. The small number 
of features also implies high efficiency in estimating the pa¬ 
rameter from spectral information. 

Compactness indicates the study of whether the num¬ 
ber of features can be reduced further. Experimental results 
show that if seven features, namely LI, L8, L13, L19, L20, 
L21 and L23, are rejected, the feature number decreases by 
7/24 = 29 percent and the MAE error increases by 0.0027 
dex (approximately 0.0027/0.2035 « 1.32 percent) |j There¬ 
fore, the feature number can be refined further if a slight 
decrease in accuracy is accepted. 


® Correspondingly, the size of the test set decreases from 40 000 
to 35 000, 30 000, and 25 000, respectively. 

By instability, we mean that a slight difference in the interstel¬ 
lar extinction of multiple stars may be observed. 

® These experiments are conducted on SDSS spectra (Section 
[241 . 
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Figure 3. Performance of the proposed scheme on 23,963 test spectra from LAMOST (10,000 LAMOST spectra for training. Section 
Vr2\ using SVRg 



(a) log Teff (b) log g (c) [Fe/H] 

Figure 4. Residual distributions of the proposed scheme on 23,963 test spectra from LAMOST (10000 LAMOST spectra for training. 
Section using SVRg 





Figure 5. Performance of the proposed scheme on 10,469 test spectra computed from KURUCZ’s model (8,500 synthetic spectra for 
training. Section [Oil using SVRg 



Delta logTeff Delta log g Delta [Fe/H] 

(a) log Teff (b) log g (c) [Fe/H] 


Figure 6. Residual distributions of the proposed scheme on 10,469 test spectra computed from KURUCZ’s model (8,500 synthetic 
spectra for training, Section lot using SVRg 
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Table 3. Performance of the proposed scheme on 23 963 test spectra from LAMOST (10 000 LAMOST spectra for training, Section [2.2|l 


Method 

Parameter 

MAE 

log Teff (Teff) 
ME 

SD 

MAE 

log 9 
ME 

SD 

MAE 

[Fe/H] 

ME 

SD 

RBFNN 

0.0070(91.14) 

6.75x10-5(2.37) 

0.0099(131.36) 

0.1664 

0.0109 

0.2753 

0.1197 

-0.0038 

0.1767 

SVRg 

0.0074(95.37) 

1.27xl0-"‘(4.30) 

0.0106(141.62) 

0.1528 

-0.0008 

0.2102 

0.1146 

-0.0112 

0.1528 

KNNR 

0.0085(111.80) 

4.47x10-4(10.08) 

0.0126(173.66) 

0.1934 

0.0167 

0.2730 

0.1625 

-0.0154 

0.2151 

LSR 

0.0082(106.51) 

6.67x10-4(11.46) 

0.0117(161.83) 

0.2218 

0.0173 

0.3404 

0.1312 

-0.0143 

0.1807 

SVRi 

0.0081(105.73) 

1.89x10-4(5.44) 

0.0116(159.83) 

0.2070 

-0.0124 

0.3145 

0.1311 

-0.0222 

0.1806 


Notes. The unit for Teff is K; The unit for logTeff is log(K). 


Table 4. Performance of the proposed scheme on 10 469 test spectra computed from KURUCZ’s model (8 500 synthetic spectra for 
training, Section [Oil 


Method 

Parameter 

MAE 

log Teff(Teff) 
ME 

SD 

MAE 

log 9 

ME 

SD 

MAE 

[Fe/H] 

ME 

SD 

RBFNN 

0.0010 (14.15) 

1.34x10-4(1.42) 

0.0014(20.26) 

0.0217 

1.89x10-5 

0.0582 

0.0203 

1.95x10-5 

0.0282 

SVRg 

0.0010 (14.42) 

2.63x10-4(3.34) 

0.0015(20.81) 

0.0123 

-9.42x10-4 

0.0590 

0.0125 

-3.58x10-4 

0.0256 

KNNR 

0.0027(39.39) 

3.92x10-5(0.19) 

0.0041 (61.08) 

0.2167 

3.55x10-2 

0.3166 

0.1007 

2.90x10-2 

0.1611 

LSR 

0.0026 (36.18) 

-2.54xl0-4(-3.68) 

0.0033(46.05) 

0.1416 

2.36x10-2 

0.1902 

0.0903 

9.77x10-5 

0.1175 

SVRi 

0.0025 (34.91) 

1.14x10-4(1.79) 

0.0032 (45.80) 

0.1343 

2 .21x10-2 

0.1920 

0.0783 

4.12x10-5 

0.1122 


Notes. The unit for Teff is K; The unit for logTeff is log(K). 


6 CONCLUSION AND FUTURE WORK 

This work investigated estimation of effective temperature 
(Teff), surface gravity (log g) and metallicity ([Fe/H]) from 
stellar spectra based on the Haar wavelet transform and 
LASSO algorithm. The proposed scheme is evaluated using 
actual spectra from SDSS and LAMOST as well as synthetic 
spectra computed from Kurucz’s model. Favorable results 
are achieved in all cases. 

The proposed scheme exhibits excellent robustness and 
sparseness. The features are extracted using two steps. From 
the SDSS data, the original spectra, described by 3821 
fluxes, are initially decomposed using the Haar wavelet 
transform and low-frequency coefficients (239 features) are 
selected as candidate features. In this step, some noises and 
redundancies with high-frequency components are removed. 
Then a small subset of the candidate features is chosen as 
spectral features using the LASSO algorithm. The second 
step is a supervised learning process that selects features 
according to their correlation with the parameter to be es¬ 
timated. The number of selected features is 17 for Teff, 
24 for log g and 25 for [Fe/H]. A representative work is 
iRe Fiorentin et ^ ((2003), in which 50 features are extracted 
for estimating atmospheric parameters. 

Another advantage of the proposed scheme is its high 
accuracy. Using the SVRg method and 40 000 stellar spec¬ 
tra from SDSS, the MAEs are 0.0062 dex for log Teff 
(85.83 K for Teff), 0.2035 dex for log g and 0.1512 dex for 
[Fe/H] . Further details are show n in Table [2] In previous re¬ 
ports, iRe Fiorentin et ^ ll2007ll estimated the parameters 
of 19 000 spectra from SDSS with MAEs of 0.0126 dex for 
log Teff, 0 .3644 dex for log g and 0.1949 dex for [Fe/H]. 
Ijofre et al.l ll2010ll first highly compressed the data using a 
likelihood method and then estimated the parameters Teff, 
log g and [Fe/H] from low-resolution stellar spectra mea¬ 
sured by SEGUE; the standard deviations (SDs) of the er¬ 
rors were 130 K for Teff, 0.5 dex for log g and 0.25 dex 


for [Fe/H]. Therefore, the results estimated using the pro¬ 
posed scheme exhibit higher accuracy compared with those 
reported in literature. 

In this work, the proposed scheme is evaluated using 
three different data sets (SDSS, LAMOST and synthetic 
spectra). On each kind of data, the proposed model is 
learned and tested independently. An interesting problem 
is the estimation of atmospheric parameters of one data 
(e.g., LAMOST ) using a model learned from the other 
data sets(e.g., SD S S or synthetic spectra). For example, 
iRe Fiorentin et ^ (l2007l l investigated how to estimate the 
atmospheric parameters of SDSS data from synthetic spec¬ 
tra and vice versa. In this process, residual calibration de¬ 
fects should be considered. This work focuses on sparse fea¬ 
ture extraction and the abovementioned problem will be in¬ 
vestigated in future work. 
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Figure 7. Dispersion can be improved by increasing the size of training set from 10 000 to 15 000, 20 000, 25 000 in the experiments on 
SDSS test spectra. This experiment is conducted using SVRq. 
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